Higher order gravity theories and scalar tensor theories 
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We generalize the known equivalence between higher order gravity theories and scalar tensor 
theories to a new class of theories. Specifically, in the context of a first order or Palatini variational 
principle where the metric and connection are treated as independent variables, we consider theories 
for which the Lagrangian density is a function / of (i) the Ricci scalar computed from the metric, 
and (ii) a second Ricci scalar computed from the connection. We show that such theories can be 
written as tensor-multi-scalar theories with two scalar fields with the following features: (i) the two 
dimensional cr-model metric that defines the kinetic energy terms for the scalar fields has constant, 
negative curvature; (ii) the coupling function determining the coupling to matter of the scalar fields 
is universal, independent of the choice of function /; and (iii) if both mass eigenstates are long 
ranged, then the Eddington post- Newtonian parameter 7 has value 1/2. Therefore in order to be 
compatible with solar system experiments at least one of the mass eigenstates must be short ranged. 



I. INTRODUCTION AND SUMMARY 

A well-known class of theories of gravity that mod- 
ify general relativity can be obtained by taking the La- 
grangian density to be some nonlinear function / of the 
Ricci scalar. The action for such theories is 



-gf{R)+Sm[g^.,ijm], (1.1) 



where g^^^ is the metric (which we have barred for later 
notational convenience), R is its Ricci scalar, / is the 
function, and = SttG. The second term is the matter 
action 5*111, which is some functional of the matter fields 
"0111 and of the metric 5^,^ ^ . The theory (|l.l(l is equivalent 
to ^ a scalar-tensor gravity theory . 

A second class of theories of gravity is given by re- 
interpreting the action Hl.l|l in terms of a Palatini or first- 
order variational principle 9]. In this reinterpretation, 
one considers the action to be a functional of the matter 
fields iprm a metric g^j/, and a connection which is 
independent of the metric. The resulting action is 



-gfiR) + s^[g^t,,ipn 



where R — 5^" R^y and R^u 



(1.2) 

is the Ricci tensor of the 
connection V^. When f{R) — R (the case of general 
relativity), the action H1.2|l gives rise to the same equa- 
tions of motion as the theory 1)1.1(1 based on the standard 
or second order variational principle. However, for more 



general functions .f(R), the theories 1)1. l|l and H1.2|l are 
not equivalent 

Modifications of gravityof bot h ty pes II 1.1 II and H1.2|) 
have been suggested 0, IH IH 111 Q Ell to explain 
the observed recent acceleration of the Universe's expan- 
sion 0, 0. However, there are as yet no successful 
models along these lines. Theories of the type (|l.l|l con- 
tain a scalar field which couples to matter with the same 
strength as does gravity, as pointed out by Chiba 0|. 
These theories are thus ruled out by solar system exper- 
iments'^ unless the scalar field is short ranged, and in 
that case it is difficult to obtain an accelerating Universe 
[l8l |. Theories of the type 1)1.2)1 are actually not modifi- 
cations of gravity at all despite appearances. Instead, 
these theories, when written in a canonical form, contain 
a scalar field with no kinetic energy energy term. When 
one integrates out the scalar field one obtains general 
relativity coupled to a modified matter action, not the 
original matter action St-n[gfj,u,ipm] that appears in Eq. 

This paper is based on the observation, due to Nima 
Arkani-Hamed that theories of the form ()1.2)) are 
unstable to matter loop corrections. Specifically, matter 
loops will give rise to a correction to the action 1)1.2)1 pro- 
portional to the Ricci scalar R of the metric g^j^. In gen- 
eral relativity these corrections merely renormalize New- 
ton's constant of gravitation. Here, however, such cor- 
rections give rise to a Lagrangian density for the gravita- 
tional action which is no longer a function of R alone, but 
instead is a function of both R and R, the Ricci scalars 



^ We use units in which h = c = 1, and we use the sign conventions 
of Ref. Q. 

^ The equivalence was first shown, in the context of a specific 
choice of /, by Higgs 01, and later independently by several dif- 
ferent researchers including Bicknell Teyssandier and Tour- 
renc |^ , and Whitt |^ . The generalization to arbitrary functions 
/ was first given by Schmidt 0. Subsequently Wands |3 once 
again independently discovered the equivalence, and indepen- 
dently derived the result for arbitrary /. 



It may be possible to evade this constraint for certain choices 
of the function f{R). For example, for f{R) oc _R^(ln_R)", the 
potential V{^) for the scalar field is of the form V{^) oc l/^" 
for large <&. For this potential, and for the matter coupling for $ 
that follows from the action 11.11 . there exists a regime in which 
the coupling of t he s calar field to large objects is suppressed by 
nonlinear effects [l^. However, it is not yet clear if this class of 
"chameleon field" theories give acceptable cosmologies. 
* This statement is valid classically. However quantum mechani- 
cally things may be more complicated. 
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of the connection and of the metric. Thus, the form of 
the gravitational action 1)1. 2|l is not preserved under loop 
corrections, and there is a fine-tuning problem inherent 
in assuming an action of the form (|1.2|) . 

That fine tuning problem is also apparent in the al- 
ternative description of the theory (|1.2(l . derived in Ref. 
[20l|, as a type of scalar-tensor type theory without any 
kinetic energy term for the scalar field [21|. The action 
of that alternative description is 



(1.3) 



where $ is the auxiliary scalar field with cou- 
pling function a($) and potential V{'^), and g^i, = 
exp[— 2a(<I>)]^^i^ is the Einstein frame metric. In terms of 
these variables, the correction term due to matter loops is 
proportional to = e~^"[i? — 6(Vq;)^ — GDa]. Therefore 
matter loops induce a kinetic term for the field $, which 
becomes a dynamical field, and the loop-corrected theory 
is a genuine modification of general relativity, unlike the 
original theory H1.2|l or H1.3|l . 

These considerations suggest that, in the context of the 
first-order variation formalism, it is natural to consider a 
more general class of theories of gravity of the form 



(1.4) 

Here the Lagrangian density of the gravitational action is 
some function / of both the Ricci scalar R of the metric 
g^v and the Ricci scalar R of the connection V^^. These 
theories are stable under loop corrections, which just give 
rise to corrections to the function f{R,R). The purpose 
of this paper is to analyze the class of theories 1)1.4(1 , and 
to derive for these theories an equivalent description, at 
the classical level, as tensor-biscalar theories. 



A. Summary of results 

We now turn to a description of our results. We define 

_ . df - - - - df - ' 

hiR,R) = jrL{R,R), f,{R,R)^-'(R,R), (1.5) 
oR dR 

and we define the signs 

ei=sign/i, £2 = sign/2, (1.6) 

and 

£3 = sign(/i + /2). (1.7) 

Our construction in Sec. ITU below shows that if ei, £2 
and £3 are constant in an initial data set, then they will 
be constant throughout the corresponding future Cauchy 
evolution. We restrict attention to this portion of phase 



space, in which the signs £1, £2 and £3 are constants, 
independent of space and time. The reason for this re- 
striction is that in regions of phase space where one or 
more of these variables flips sign somewhere in spacetime, 
we suspect that the theory does not posses a well posed 
initial value formulation ^. Assuming that £1, £2, and £3 
are constants, we show below that in order for the theory 
not to contain ghosts or negative energy excitations, we 
must have 



and 



£1 = 1, 



£2 



£3 = 1- 



(1.8) 



(1.9) 



(1.10) 



Next, we further restrict the region of phase space un- 
der consideration, and the set of allowed functions /, as 
follows. We define A{R,R) to be the determinant of the 
matrix of second order partial derivatives of / with re- 
spect to R and R. We assume that there exists an open 
domain T> of points {R, R) for which the following three 
conditions are satisfied: (i) We have 



A{R,R) ^ 



(1.11) 



for all points {R, R) in 1); (ii) The conditions (|1.8() - 
(|1.10() are satisfied for all points {R, R) in V; and (iii) 
Equations H1.16|l and (|1.17|l below define a bijection be- 
tween T> and some open domain V of field values ($, ^'). 
Then, the theory (|1.4() with initial data in the domain T) 
is equivalent to a tensor-biscalar theory with field values 
in the domain V . 

We next describe this tensor-biscalar theory. The ac- 
tion for a general tensor-multi-scalar theory is the fol- 
lowing functional of the matter fields tpm, the Einstein- 
frame metric Oujy, and a N-tuple of scalar fields = 



(1.12) 



^ The reason for this suspicion is that points in spacetime of at 
which £1 , £2 or £3 Aip sign are points at which one of the three 
kinetic terms in the action l2.2Ui below vanishes. The initial 
value formulation is known to break down in other, simpler con- 
texts in which this type of phenomenon occurs. For example, the 
theory S = J d'^Xy/=^[R/(2K'^) - (V$)2/2 - ^ij*^] has a good 
initial value formulation in the region of phase space < 3>c, 
where <I>c = 1/ \/2K?li,, but not once $ crosses the value $0 
since $ = <3?c, Sfiv = (anything) is a solution of the equations of 
motion. 
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This action is characterized by the coupUng function 
a{^^) and potential V{^^) of the scalar fields, and by 
the (T- model metric jab{^^)- For the theory 1)1.4(1 we 
have AT = 2, $^ = = ^j^g cr-model met- 

ric is given by 



V6 



and the coupling function is 



a($, ^I^) = + In cosh ( 

The potential is given by the formula 



(1.14) 



v{<^, *) - ^^^^ [^/i ^) + ^/2((p, V) - fiv, V')] • 

(1.15) 

Here we have introduced two additional scalar fields tp 
and ip, which are given in terms of $ and \1/ by the equa- 
tions ^ 



= exp 



V6 . 



and 



l/2(y^»| 



tanh 



V6 



(1.16) 



(1.17) 



For specific choices of the function /(i?, R) it is 
straightforward to compute the potential V from Eqs. 
(fTTBI) - (fTTTIl . For example, if / = i? - J? -h J^iJ/m^ we 
have y = m?u{l — u)/{2k^), where 



u — cosh^ ( —;=^ 
,V6 



1 - cxp ( 2^* 



(1.18) 



II. DERIVATION OF EQUIVALENCE 

Consider the gravitation theory given by the action 
(|1.4|) . An action which is equivalent to this at the classi- 
cal level can be obtained by using the technique of Refs. 
0,13 of introducing auxiliary scalar field(s). Specifically, 
we introduce two scalar fields (/? and ■0, and we define the 
action 



+ {R~ + {R- ij)f2i^,ij) 

mj ■ 



/(^,V') 



(2.1) 



The loft hand sides of Eq. 11.171 is less than one by Eqs. ILTI 
and ll.lUi . so it is possible to solve this equation for <&. 



Then the equations of motion for ip and ip enforce ip = R, 
ip = R, as long as 



det 



^0. 



(2.2) 



The condition 1(2. 2|l will always be satisfied because of 
our assumption p. 11(1 above. Therefore the actions p.4() 
and 1(2.11) are classically equivalent. 

We write the action a.s S = Si + S2 + S3 + 5'm, 
where 



2k2 



^2 = TT / d^x^Rhiv.^). 



2^2 



and 



2k2 



(2.3) 



(2.4) 



(2.5) 



The action Si can be rewritten as an Einstein- frame form 
using the following standard procedure. We define the 
scalar field p by 



fi{(p,ip) = eie" 



-2p 



(2.6) 



where ei = ±1 [cf. Eq. 1(1.6(1 above]. Defining the confor- 
mally transformed metric 

g^,^ = exp[-2p]g^i„ (2.7) 
and using R = e~^P[R — 6(Vp)2 — 60^], we obtain 



£1 
2k^ 



'9 



R-GiVpY' 



A similar transformation can be carried out for the 
action ^2. We define the scalar field a by 



f2{'P,1p) = £26 



(2.9) 



where £2 = ±1 [cf- Eq. p.6() above]. Now the Ricci scalar 
R that appears in the action ^2 depends both on the 
metric g^^, and on the connection via 



R = g^'^ Rfiv, 



(2.10) 



where is the Ricci tensor of the connection V^. Un- 
der conformal transformations of the metric that preserve 
the connection, this quantity will undergo a simple, mul- 
tiplicative transformation coming from the first factor in 
Eq. itTTUIl . Thus, if we define 



g^,u = exp[-2cr]5^,. 



(2.11) 



we obtain 



£2 



S2[g^u,^^.,vM - 1^ / d^x^-g{r''R^..)■ (2.12) 
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Next, we note that the total action p.lfl depends on 
the connection only through the term H2.12() . Also 
the action 12.12|) is just the Palatini- variation version of 
the Einstein- Hilbcrt action of general relativity. There- 
fore, the equation of motion for stipulates that 
is the connection determined by the metric (jp,iy- In 
the classical theory, we are free to substitute the relation 
V;^ = back into the action (|2.12l) to obtain 



£2 



gR, 



(2.13) 



where R is the Ricci scalar of the metric ^^y. In other 
words, we obtain an equivalent theory at the classical 
level by replacing the Palatini-variation action ^2 with 



its standard- variation version 5*2 . 



Next, we write the action (|2.13|l in terms of the metric 
g^^ using the definitions H2.7|l and H2.11|l . This gives 



'S'2[ffMi/,'/3, V'] 

where 



g2 

2«;2 



T = p — a. 



e2"i? + 6e2"(Vr)2 , 
(2.14) 

(2.15) 



Adding this to the action (|2.8|) and rewriting in terms of 
the conformally transformed metric 



(2.16) 



gives 



Si + S'2 = 



2^2 



^\ 6£2e2^e2"(Vr)2 



+e^^{e, + £26"") [R - 6(Vx)' - GOx] 
-6eie^^{Vp)A. (2.17) 



Next, it follows from the definitions H2.6|l . (|2.9|l and 
(12.15(1 of p, a and r and the definition (|l.lUfl of £3 that 



£3 = sign(£i -I- £26 "■) 



(2.18) 



We now choose the conformal factor e^-^ entering into the 
definition H2.16|l of the metric g^^, to satisfy 



e^^\ei+e2e^^\ = l. 



(2.19) 



Using Eqs. 1)2. 18() and (|2.19(l the action p. 17(1 simplifies 



to 



2k 



esR- 



6£i£3 



£1 



6£i£2£3e^'^ 

(£1 -|-£2e2^): 



r(Vr)= 



(2.20) 



Now the sign of the coefficient of the Ricci scalar R in 
the action H2.20|l must be positive, otherwise the matter 



action 5,n will contain fields whose kinetic terms have 
the wrong sign relative to the gravitational action, giv- 
ing rise to an unstable theory. This yields the condition 
(|1.1UI) . Similarly, requiring that the last two terms in the 
action H2.20|l have their conventional signs ^ and using 
the relation 1)2.18)1 yields the conditions 1)1.8)1 and 1)1.9)1 . 
Next, we define the fields <& and ^' by 



and 



\/6 

$ = sign(T) tanh^^ 



T ^ 
* = p. 

K 



It follows that 



£1 +£26 



= cosh^(K$/\/6), 



(2.21) 



(2.22) 



(2.23) 



and substituting into the action ()2.20)) reproduces the 
first and third terms of the action ()1.12)l . The last term 
in the action ()1.12l) is just the fourth term in the action 
()2.1)) where we have used the relation 



gfiu — 6 gfiu 



(2.24) 



between the Jordan-frame metric g^^i, and the Einstein 
frame metric g^^. In order to compute the coupling func- 
tion a that is defined by the relation ()2.24l) . we combine 
the definitions ()2.7)l and ()2.16)) of g^i/ and together 
with the formula 1)2.19)1 for the conformal factor e^^. This 
yields 



=2p 



£1 + £26^ 



(2.25) 



and combining this with Eqs. ()2.22)) and ()2.23)) yields 
the formula 1)1.14)1 . Finally, the expression ()1.15l) for the 
potential is obtained by combining Eqs. ()2.5I) and 1)2. 24)1 . 
and the formulae ()1.16)l and 11.17)1 for ip and ip in terms 
of $ and 5* can be obtained from the definitions ()2.t))l . 
(E3, lfTT3|l . and ifT^ . 

Finally, we note that the condition 1)1.10)1 together with 
Eqs. lfLH|l . (fO|l and ifTTHIl imply that r < 0, which im- 
plies from Eq. 1)2.21)1 that 



$ < 0. 



(2.26) 



^ If the sign of a scalar-field action relative to the gravitational 
action is flipped from its usual value, then the Hamiltonian of 
the theory is unbounded below and one expects the theory to 
be quantum mechanically unstable. Note however that in such 
theories the decay timescales may be long enough for the theory 
to make sense as an effective fiel d th eory over a certain range 
of scales, as suggested in Refs. 13. 1221 . In this paper we restrict 
attention to sectors where the signs of the scalar-field actions 
have the conventional value. 
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In other words, the two theories are equivalent in the fol- 
lowing sense: solutions of the original theory (|1.4|) can 
be mapped onto solutions of the tensor-biscalar theory 
(|1.12l) that satisfy 12.26|l . This restriction is somewhat 
puzzling as the condition (|2.26() is clearly not preserved 
under general dynamical evolution. The resolution of this 
puzzle is that if one starts with an initial data set satisfy- 
ing (|2.2t)|) and evolves forward using the tensor-biscalar 
theory until the condition (|2.26|l is violated, the locus of 
points at which <I> = corresponds to a singularity of 
the original variables g^i^, V^. This can be seen from 
Eq. (|1.17ll . since we have argued that £2 — sign(/2) can- 
not flip sign, so /i must pass through ±00 as $ passes 
through 0. 

We also note that the methods of this paper can be ap- 
plied to special choices of the function f{R, R) for which 
the quantity A(_R, R) vanishes identically, cf. Eq. 
above. In such cases one obtains a theory with one scalar 
field. The choice f{R,R) — R + g{R) is equivalent to a 
theory of the type (jTIJ with f{R) = R + g{R). The 
choice /(-R, R) = R + g{R) can be obtained from the 
above analysis with £1 = 1, p = (p = = 0. In this case 
the formula (|1.15|) for the potential becomes 

4q($) 

where tanh^(K$/V6) = \g'{i^)\- 



III. DISCUSSION AND IMPLICATIONS 

Consider first the compatibility of the theory (|1.12|) 
with solar system experiments. Lets assume that the 
background, present-day cosmological values of the fields 
$ and 5" are $0 and ^^q. Lets also assume, initially, that 
the potential V is such that the fields 5$ = $ — $0 and 
= i]/ _ vJ/q have small masses and are long ranged on 
solar system scales. The Eddington parameterized post- 
Newtonian (PPN) parameter 7 for theory is then given 

by i 



1 



2al 



where 



2 da da 



(3.1) 



(3.2) 



evaluated at the present-day background values of the 
fields. Evaluating the expression l|3.1|l using Eqs. (|1.13|) 
- (|1.14|l gives 7 = 1/2. This is in conflict with solar 
system VLBI observations which show that |2^ 



I7- 1| < 3 X 10" 



(3.3) 



It follows that the potential V{^, ^E") must be such that 
at least one of the two mass eigenstates for perturbations 



about ($Oj^o) is short ranged. Those mass eigenstates 
are determined from the equations j3| 



7ab($^) = E^^Ai 



(3.4) 
(3.5) 



where j labels the eigenstate, mj are the masses, and \/ a 
is the covariant derivative determined by the metric jab 
on field space. The formula H3.2|l is then replaced by 



E 



da 



(3.6) 



where J^'j denotes the sum over the light, long-ranged 
eigenstates. 

If there is one light and one heavy eigenstate, and if 
X is the angle in field space between the light eigenstate 
and the 9/9$ direction, then a short calculation using 
Eqs. (fTT^ . (fTT^ and (|33|) gives 



[sin X + cos X sinh(K<I>o /\/6)] 
3cosh^(K$o/V6) 



(3.7) 



Thus if X <C 1 and k$o ^ 1, then a^ ^ 1 and the solar 
system constraint (13. 3|) can be evaded. It is possible to 
find functions f{R, R) and values of ($oj ^0) for which 
X <C 1. This is because x is determined by the values of / 
and its derivatives up to third order at the point (i?o, i?o) 
that corresponds to ((f>o,^o), so that there many more 
free parameters than there are constraints. Thus, there 
exists a class of gravitation theories of this type that are 
compatible with solar system observations. 

It might be possible to further specialize the choice of 
f{R, R) to give a viable model of the recent accelera- 
tion of the Universe, along the lines suggested by Carroll 
et. al. However, in such a case, it would not be 

completely accurate to call the theory H1.4|) a modified 
theory of gravity, since the theory would contain a scalar 
field which acts mostly as a source of gravity and does 
not mediate any long range forces. Rather, the explana- 
tion for the acceleration of the Universe would in such 
a case be partly a modification of gravity and partly a 
quintessence-type source of gravity. 
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